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Abstract 



The random assignment (or bipartite matching) problem asks about 
A n = min n X^ILi c (*> tt(*))» wnere ( c (*>j)) is a n x n matrix with i.i.d. 
entries, say with exponential 1) distribution, and the minimum is over 
permutations ir. Mezard and Parisi (1987) used the replica method from 
statistical physics to argue non-rigorously that EA n — > C(2) = 7r 2 /6. Al- 
dous (1992) identified the limit in terms of a matching problem on a limit 
infinite tree. Here we construct the optimal matching on the infinite tree. 
This yields a rigorous proof of the £(2) limit and of the conjectured limit 
distribution of edge-costs and their rank-orders in the optimal matching. 
It also yields the asymptotic essential uniqueness property: every almost- 
optimal matching coincides with the optimal matching except on a small 
proportion of edges. 

Key words and phrases. Assignment problem, bipartite matching, cavity 
method, combinatorial optimization, distributional identity, infinite tree, prob- 
abilistic analysis of algorithms, probabilistic combinatorics, random matrix, 
replica method. 

AMS subject classifications. 05C80, 60C05, 68W40, 82B44. 



*Research supported by N.S.F. Grant DMS99- 70901 



1 



1 



Introduction 



1.1 Background 

Consider the task of choosing an assignment of n jobs to n machines 
in order to minimize the total cost of performing the n jobs. The 
basic input for the problem is an n x n matrix (c(i,j)), where c(i,j) 
is viewed as the cost of performing job i on machine j, and the 
assignment problem is to determine a permutation ir that solves 

n 

A n = min^c(i,7r(i)). 
n i=l 

No doubt the simplest stochastic model for the assignment prob- 
lem is given by considering the c(i,j) to be independent random 
variables with the uniform distribution on [0,1]. This model is 
apparently quite simple, but after some analysis one finds that it 
possesses a considerable richness. Steele Whl - 

Part of this richness is that there have been four different approaches to the 
study of the random assignment problem, which we shall describe briefly See 
for further history. Our focus is upon mathematical properties of A n 
and the minimizing 7T, rather than algorithmic questions. 

(a) Rigorous bounds via linear programming. Steele (|2^] Chapter 4) goes on 
to give a detailed account of results of Walkup [K5] , Dyer - Frieze - McDiarmid 



|H and Karp |13|] which lead to the upper bound EA n < 2. The lower bound 
limsup n £7i n > 1 + e _1 was proved by Lazarus [[U| and subsequent work of 
Olin |2^] and Goemans - Kodialam (ll]] improved the lower bound to 1.51. 
Recently Coppersmith - Sorkin [[jj improved the upper bound to 1.94. 

(b) The replica method. Mezard - Parisi |2l]] gave a non-rigorous argument 



based on the replica method [22| to show EA n — > ir 2 /6. The replica method 
and the somewhat related cavity method is extensively used in statistical physics 
in non-rigorous analysis of spin glass and related disordered systems p3], and 
there is considerable interest in mathematical formalization of the method (see 
e.g. Talagrand ||). 

(c) Conjectured exact formulas. Parisi [^] conjectured that in the case 
where the c(i,j) have exponential(l) distribution, there is the exact formula 

n 

EA n = Yj r2 - (!) 
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It is natural to seek an inductive proof. Coppersmith - Sorkin H and others 
J|, ||, 17] have recently formulated, and verified for small n, more general conjec- 
tures concerning exact formulas for related problems, e.g. involving incomplete 
matches on m x n bipartite graphs. But (Q) has not been proved rigorously (an 
argument of Dotsenko is incomplete). 

(d) Weak convergence. Aldous |2| proved that lim n EA n exists, by identi- 
fying the limit as the value of a minimum-cost matching problem on a certain 
random weighted infinite tree. 

1.2 Results 

In this paper we continue the "weak convergence" analysis mentioned above. By 
studying the infinite-tree matching problem, we obtain a rigorous proof of the 
Mezard - Parisi conclusion EA n — > tt 2 /6 (Theorem H), and also of their formula 
for the limit distribution of edge-costs in the optimal assignment (Theorem |2|) , 
and of a conjecture of Houdayer et al [12| concerning order-ranks of edges in the 



optimal assignment (Theorem |3|). Theorem |] introduces and proves an asymp- 
totic essential uniqueness (AEU) property of the random assignment problem: 
roughly, AEU asserts that every almost-optimal matching coincides with the 
optimal matching except on a small proportion of edges. Studying AEU is in- 
teresting for two reasons. First, it can be defined for many "optimization over 
random data" problems (section |6.1| ), providing a theoretical classification of 
such problems (AEU either holds or fails in each problem) somewhat in the 
spirit of computational complexity theory. Second, in the statistical physics of 
disordered systems it has been suggested [^] that the minima of the Hamil- 
tonian should typically have an ultrametric structure, suggesting that in the 
associated optimization problem the AEU property should fail. Theorem |3] 
thus shows that the random assignment problem (studied by physicists as a 
toy model of disordered systems) has qualitatively different behavior than that 
predicted for more realistic models. 

It follows easily from |^] that the distribution of the c(i,j) affects the limit 
of EA n only via the value of the density function of c(i,j) at (assuming this 



exists and is strictly positive: see section 5.1 for the power-law case), and so 
in particular we may assume that the distribution is exponential(l) instead of 
uniform. Indeed, to avoid later normalizations it is convenient to start out by 
rephrasing the problem as 

n 

A n = mini^c(i,vr(i)) 
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where the (c(i,j)) are independent with exponential distribution with mean n. 
Write 7r n for the permutation attaining the minimum. 

Theorem 1 lim n EA n = 7r 2 /6. 



Theorem 2 c(l,7r n (l)) converges in distribution; the limit distribution has 
density 



, . e x (e x - 1 + x) 
h ^ = (i- e -s)2 ' - x < 00 • 



Theorem 3 For eac/i > 1 define 

Qn{k) = P(c(l, 7r n (l)) zs i/ie /c 'i/t smallest of {c(l, 1), c(l, 2), . . . , c(l, n)}). 
JTien lim n g n (A;) = 2 _fc . 



Theorem 4 For eac/i < 5 < 1 f/iere exzste e(S) > suc/i i/iat, if fi n are 
permutations (depending on (c(i,j))) such that En~ l #{i : [A n (i) ^ vr n (i)} > 5, 
then 

lirninf E U c(i, //„(»)) J > vr 2 /6 + 

The logical structure of our proof is as follows. 

• Review from Q the infinite-tree minimum-weight matching problem and 
its connection with the finite-n problem (section |Q| ). 



Describe one matching M. Q pt in the infinite-tree setting (section |3.3| ). 

Calculate the cost of the matching 7V4 pt and give an expression for the 
extra cost of any other matching M (section |3? 



In principle our method should yield an explicit lower bound for e(5) in Theorem 



see section 4.4 



Matchings on the infinite tree are required to satisfy a certain spatial invari- 
ance property, and this is the only technically difficult ingredient of the paper. 
Granted the abstract structure, the actual calculations underlying the formulas 
in Theorems [l] - || are quite straightforward, and we set out these calculations 
first in section ||. In a sense the calculation of the limit in Theorem |l| can be 
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summarized in one sentence. Let X,Xi be i.i.d. with distribution determined 
by the identity 



X = min(^j — Xj) where is a Poisson(l) process; (2) 

i 

then 

limEAn = J xP{X\ + X 2 >x) dx. (3) 

From these it is straightforward (section to evaluate the limit as 7r 2 /6, though 
we do not have any calculation-free explanation of why such a simple number 
should appear here or in Theorem |3[ 

We remark that the technically most difficult, and conceptually most im- 
portant, part of the proofs of Theorems || - || is the result from j|] (restated as 
Theorem 11) connecting the finite-n assignment problem with the infinite-tree 
problem. The new additions of this paper are precisely the construction (sec- 



tion 3.3) of the matching -M pt on the infinite tree, and its analysis in sections 



3.4 and [Tj^, together with the elementary calculations in section |2[ 



1.3 Discussion 



This paper was of course motivated by a desire to make the Mezard - Parisi [21| 
result rigorous. In retrospect our method, based on weak convergence to a limit 
random structure, seems roughly similar to the non-rigorous cavity method from 



statistical physics (see [|26|, |29(] for brief descriptions), and Talagrand |31[] for 
its rigorous application in the Sherrington-Kirkpatrick model) though I do not 
understand that method well enough to judge the degree of similarity. So can 
our method be used for other mean-field models of disordered systems in which 
the cavity method has been used? It turns out it is easy to use our method 
to write down heuristic solutions to three variations of the random assignment 
problem: 

• The case where the density function / c (-) of the costs c(i,j) has f c (x) ~ x r 



as x I (section |5.1|) . 



The one-parameter Gibbs measure associated with the assignment prob- 
lem, recently studied rigorously by Talagrand [|30| (section |5.3| ). 



• The random combinatorial traveling salesman problem (section |5.2|) . 

We shall compare heuristic solutions from our method with those from the 
cavity method. Making a rigorous proof in these variations involves precisely 
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the issue in : one needs a rigorous argument identifying the optimal solution 
of an infinite-tree problem as the limit of optimal solutions of finite-n problems. 

It is worth noting that in our method, calculations of limit quantities de- 
pend on solving a problem-specific distributional identity (^, |38|, [0], ^). Such 
identities arise frequently in probabilistic analysis of recursive algorithms [27| 



and "probability on trees" [18], and seem worthy of further systematic study. 

Section ^ contains miscellaneous discussion; the AEU property in more 
general contexts, the (lack of) insight our results cast on the exact conjecture 
(|l|) and the intriguing analogy with Frieze's £(3) result. 



Our method rests upon a kind of "local convergence" (12) of 7r n to A4 op t, 
saying that the structure of ir n and edge-costs within a finite distance (inter- 
preting edge-costs as distances) of a typical vertex converges in distribution 
to the structure of M op t and edge-costs in the infinite tree. So in a sense it 
is Theorem ^ which is our basic result. We can deduce Theorem |] because 
EA n = Ec(l,ir m (l)), but we cannot study the variance of A n by our methods. 
On the other hand we could write down expressions for the limit behavior of 
arbitrarily complicated "finite-distance" variations on Theorems || and ||, e.g. 
for 

j(x, y, z) : = lim 

-E ( number of pairs (i, k) : c(i, vr(i)) < x, c(k, ir(k)) < y, c(i, n(k)) < z) . 

2 Calculations 

Consider the symmetric probability density 

f x ( x ) = (>/ 2 + e - x ' 2 ) " 2 , -oo < x < oo 



which statisticians call the logistic distribution. It is standard [14] that the 
corresponding distribution function and variance are 

F x (x):=[ f x {y)dy = (l + e- x y 1 , -oo < x < oo (4) 

/oo 
x 2 f x (x)dx = ir 2 /3 (5) 
-oo 

and that the logistic distribution is characterized by the property 

f(x) = F(x)(l- F(x)), -oo<x<oo. (6) 
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In the calculations below we seek to exploit symmetry and structure, rather 
than rely on "brute force calculus" . 

Lemma 5 Let < £1 < £2 < • • • be the points of a Poisson process of rate 1. 
Let (X;Xi,i > 1) be independent random variables with common distribution 
fi. Then 

min - Xi) = X (7) 

l<i<oo 

if and only if \i is the logistic distribution. 

Proof. The points form a Poisson point process V on (0, 00) x 

(—00,00) with mean intensity p(z, x)dzdx = dz fj,(dx). The distribution func- 
tion F of n satisfies 

1 - F(y) = P (min (& - X t ) > y) by (|) 

\l<i<oo / 



P (no points of V in {(z, x) : z — x < y}) 
exp ^— y y /j(z, x) dzdxj 

exp ^— y F(z — y) dz^j , where F(y) = 1 — F(y) 



/ roc _ \ 
= exp I — / F{u)du 

Differentiating, we see this is equivalent to 

F'{y)=F{-y)F{y). (8) 

This implies the density F'(-) is symmetric, and so (||) is equivalent to the 
condition ([]) characterizing the logistic distribution. □ 

In some subsequent calculations we use the general identity (for arbitrary 
real-valued r.v.'s V, W) 

/oo 
P(V > x > W) dx. (9) 
-00 

Lemmas ||, |7] and [9| are the "calculation" parts of Theorems [l], [2] and ^ respec- 
tively. 

Lemma 6 Let X\ and X2 be independent random variables with the logistic 
distribution. Then 

h(x) := P{Xi + X 2 > x), < x < 00 
is the density of a probability distribution on [0, 00) with mean 7r 2 /6. 
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Proof. To show it is a probability density, 



h(x)dx = E(Xi + X 2 ) + 

= E(X\ — X 2 ) + by symmetry 

/oo 
P{X 1 >y>X 2 ) dy by (| 
-oo 

(l-F(y))F(y) 
/(y) by (§) 



oo 



-oo 
oo 



And the mean is 



= 1. 

xh(x)dx = / xP(Xi + X 2 > x) dx 
Jo 

= \E{{X 1+ X 2 ) + f 

= jE(Xi + X 2 ) 2 by symmetry 

= \EXl 

= 7r 2 /6 using (|5|). 



□ 



The next lemma derives an explicit formula for h(x), though it is not needed 
except to state the conclusion of Theorem ^ (my thanks to Boris Pittel for this 
calculation) . 



Lemma 7 



h(x) 



e x (e x — 1 + x) 



(1-e 



-x\2 



Proof. 



h(x) 



(g«/ 2 _|_ g— 1 _|_ qX—u 

du 



< X < OO. 



du 



(10) 



-oo 

CO 



(e u + l) 2 e x + e u 
t 

(t + l) 2 (t + e x ) f 
e x ( 1 



{e x -l) 2 \t + l t + e x J e x -l(t + l) 



dt 
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e x f t + 1 
(ex_i)2 [ 0g t + e* 
xe x 1 



+ 



1 t + 1 



(e x - l) 2 e x - 1 

and this equals the formula at (|H]). □ 

Mezard - Parisi (JT^j eq. (33)) write down the formula 

~ x — e~ x sinhx 

Kx) = — T2 

sinn x 

as the limit density for edge-costs in the optimal assignment, in the essen- 
tially equivalent non-bipartite matching problem on 2n vertices. One can check 
h(x) = 2h(2x). The factor of 2 merely reflects the different normalization con- 
vention (dividing by 2n instead of n). 

Lemma 8 Let (X,Xi,X2,rj) be independent, the X 's having logistic distribu- 
tion and n having exponential(l ) distribution. Then 

X = mm(X 1 ,X 2 )+r ] . (11) 
Proof. In the setting of Lemma ||, 

••) = (V,ri + Z'x,v + &,■■■) 

where the (£0 are a Poisson (rate 1) process independent of n. So identity (0) 
implies 

X = 7] + min(-Xi, min(£- - X i+1 )). 



i>l 



But by (§ minimi ~ x i+i) = X 2, so 

X = n + mm(-X 1 ,X 2 ) 

and the result follows from the symmetry of the logistic distribution. □ 

Note that ( pi] ) does not characterize the logistic distribution, because X+ 
constant will also satisfy (|Tl|). But it is not hard to show (Antar Bandyopad- 
hyay, personal communication) these are the only solutions. 

Lemma 9 In the setting of Lemma for each k > 1 

/ P ( x - X < min - Xi) and x > ( k ) dx = 2~ k . 

JO V l<i<oo / 
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Proof. Using symmetry of the logistic distribution, we may rewrite the inte- 
grand as P(£k < x < mini<j <00 (£j + Xi) — X). Then by @ the value (Q(k), 
say) of the integral is 



Q(k)=E min fa + X<) - fa + X) 

\l<i<oo 

Write 

(6,6,-- •) = (6 - Vk-i,€k - Vk-2, ■ ■ ■ ,6t - vii€k,fik + £1,6 + ^2> • • ■)> 

so that is a Poisson process. Then 

Q(fc) = £ I min(Xi - %_i, . . . ,X fc _i - r/i,X fc , min (£■ + X k+i )) -X 

\ l<i<oo 

= E(mm(X 1 -ri k - 1 ,...,X k _ 1 -ri 1 ,X k ,X k+1 ) - X) + 
by ([?]) and symmetry 

/oo 
P(X < x < min(Xi - r) k -i, ■ ■ -,X k -\ - rji,X k ,X k+1 ))dx by (|) 
-oo 

/oo 
F(x)P(x < min(Xi - 771, X fc )) (1 - F(x))dx 

-00 

/oo 
f(x)P(x < minpfi - 77^—1, . . .,X k -i - m, x k)) dx by © 
-00 

= P(X < min(Xi - r? fc _i 5 . . . , X k -% - rjx,X k )). 

Since (771, . . . %_].) are distributed as the first fe — 1 points of a Poisson process, 
we can relabel variables to obtain 

Q{k) = P(X < mm(X ,X 1 - ^,X 2 - . . . - 

For k = 1 this says Q(l) = P(X < Xq) = 1/2, so it is enough to show 
Q(k) = \Q{k - 1) for k>2. Write M = mm(X,X ). Then P(X = M) = 1/2 
and this event is independent of the value of M, so 

Q(k) = \P{M < mm(X 1 - £ h X 2 , X k _ x - 

Writing 

(6,6,^3, •••) = (v,v + &,v + &,■■■) 

and setting X- = X i+ i gives 

Q(k) = |P(M + n < mm(X' ,X[ X' k _ 2 - &_ 2 )). 

But Lemma |8| shows M + rj = X and so we have proved Q(k) = \Q{k — 1). 
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3 Working on the infinite tree 



3.1 The infinite matching problem 

In this section we review results from Aldous ||], which unfortunately require 
considerable space to describe. 

Let V be the set of finite words v = v±V2 ■ ■ ■ Vd, < d < oo where each 
Vi is a natural number; include in V the empty word (p. There is a natural 
tree T with vertex-set V and edge-set E, where an edge e G E is of the form 
e = (v, vj),j > 1, where for v = v\V2 ■ ■ ■ va we write vj = v\V2 ■ ■ ■ Vdj for the 
j'th child of v , and call v the parent of vj. Now attach random edge- weights 
as follows. For each v G V let the weights (W(v, vj), j > 1) on the edges 
((v,vj),j > 1) be the points of a Poisson (rate 1) point process on (0,oo), 
independently as v varies. Call this structure the Poisson-weighted infinite tree 
(PWIT). See figure 1 for an illustration. Write A for the probability distribution 
of the whole configuration (W (e)) of edge-weights. So A is a probability measure 
on the space W = (0, oo) E of all possible configurations w = (w(e), e E E) of 
edge- weights. 




A matching m on the PWIT is a set of edges of T such that each vertex is 
incident to exactly one edge in the set. Formally we can identify the set M 
of matchings as the subset M C {0, 1} E defined by: m = (m(e)) € M iff 
J2e-.v£e m(&) = 1 Vi) £ V. A matching can also be regarded as a map from 
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vertices to vertices, in which case we write m (v) = v' to indicate (v, v') is an 
edge in the matching. See figure 2 for an illustration. 
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Figure 2. Part of a matching m on the PWIT. Only the edges of m are 
drawn. 



We study random matchings M on the PWIT. Such a random matching will be 
dependent on the edge-weights, so formally we are dealing with a probability 
measure \x on W x M (the joint distribution of edge- weights and indicators of 
edges in the matching) with marginal distribution A on W. 

The connection between matchings M. on the PWIT and matchings 7r n in 
the n x n random assignment problem is via local convergence of matchings 
on T induced by the unfolding map; these ideas are explained at the end of 
this section. Theorem |ll] below says (roughly speaking) that lim n EA n is the 
average cost per edge in a minimum-cost matching on T. At first sight such a 
result looks false, because there is £l greedy matching J\A. greedy 

consisting of edges 

(<M), (2,21), (3,31),... ,(11,111), (12,121), , and 7W greedy has average 

cost equal to 1, which is certainly not the desired \\m n EA n . But the precise 



result is more complicated, because matchings in Theorem 11 are required to 
satisfy a certain property: informally 



12 



(*) The rule for whether an edge e is in the matching should be 
spatially invariant, that is should not depend on which vertex was 
chosen as the root of T. 

We now work towards making this notion precise. Our definitions are superfi- 
cially different from those in ||] , but we reconcile them in section [4.5| , 

For each w € W and each i > 1 we define an isomorphism of T (that is, a 
bijection V — > V which induces a bijection E — > E; we denote either bijection 
by 0y(-)) which expresses the idea "make vertex i the root, and relabel vertices 
to preserve order structure". Precisely, first define k G {1, 2, 3, . . .} by 

w(i,i(k — 1)) < w(4>,i) < w(i,ik) 

interpreting the left as for k = 1. Then define 
Of {t) = 

ef{4>) = k 

Of takes vertices (il, i2, . . . , i(k — 1); ik, i(k+l), . . .) to vertices (1, 2,...,k — 
1; k + 1, k + 2, . . .) 

for v = ij 2 ...ji and I > 3 let 6?(v) = {6f(ij 2 ))h . . . ji 
for v = jij 2 -.-ji where ji / i let 6™(v) = kjij 2 ...ji- 

As already mentioned, the space WxM describes possible combinations (w, m) 
of edge-weights w and a matching m. Consider the enlarged state space Z = 
W x M x {1,2,3,...} with elements (w,m, k) where we interpret the k as 
meaning that vertex k is distinguished. The maps Of induce a single map 
9 : Z — > Z which we interpret as "relabel the distinguished vertex as the root, 
relabel other vertices to preserve order structure, and make the previous root 
into the distinguished vertex". Precisely, 0(w, m, i) = (w,m, k) where 

w(e) = w{6?{e)) 

m(e) = rh(9™(e)) 

k = er (<!>). 

See figure 3. Note 9 = -1 . 

A probability measure fi on W x M (describing a random matching on the 
PWIT) extends to a a-finite measure //* := fj, x count on Z, where count is 
counting measure on {1,2,3,...}. 

Definition 10 A random matching Ai on the PWIT, with distribution /x (say) 
on W x M, is spatially invariant if /i* is invariant under 9, that is if = 
(j,(0 (•)) as a-finite measures on Z. 
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Figure 3. Take the edge-weights w in figure 1, and the matching m in 
figure 2, and distinguish vertex 2. Then figure 3 illustrates part of #(w, m, 2) = 
(w, rh, 1). Figure 3 shows the new edge- weights w attached to the edges; edges 
in the new matching rh are indicated by *; the new distinguished vertex is 
vertex 1. 



In a random matching Ai the root cj) is matched to some random neighbor 
vertex J\A (4>) € {1,2,3,...}, and the edge (4>, M. {cj))) has some random cost 
W((f), A4 (4>))- We can now reformulate the main result of |2) as 
Theorem 11 

lim EA n = mi{EW(6, M U))} 

n 

where the infimum is over all spatially invariant random matchings A4 on the 
PWIT. 

Though Theorem |ll] is the only result from [Q] needed to prove Theorem |l], the 
other new theorems need further infrastructure from [@|. The reader should 
perhaps skip the rest of this section on first reading. 
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A random matching in the n x n random assignment problem (in brief, an 
n-matching) was denoted by ir n in the introduction, but can be reformulated 
as a n x n {0, l}-valued random matrix Ai = (m(i,j)). Call an n-matching 
spatially invariant if the joint distribution ((c(i, j),m(i, < i,j < n) is 

invariant under the automorphisms of the complete bipartite graph, that is 
under permutations of i, under permutations of j, and under complete inter- 
change of i and j. Given any random n-matching, by applying a uniform 
random automorphism one obtains a spatially invariant n-matching with the 
same distribution of average-cost-per-edge A n , so there is no loss of generality 
in considering only spatially invariant n-matchings. There is a notion of "local 
convergence" Ai n -^f oc& \ Ai, made precise at (^) later, which implies in par- 
ticular that c(l, Ai n (1)) W(4>,M ((p)). The following two results (stated 
slightly differently in [0] - see section |4.5[) immediately imply Theorem [ll]. 

Theorem 12 Let Ai be a spatially invariant matching on the PWIT with 

EW((j), Ai (4>)) < oo. Then there exist spatially invariant n-matchings Ai n 

such that Ai n — >f oca i Ai and (c(l,Ai n (1)), 1 < n < oo) is uniformly inter- 
grable. 



Theorem 13 Let j n f oo ; and let Aij n be spatially invariant j n -matchings 

such that limsup n Ec(l, Mj n (1)) < oo. Then there exists a subsequence k n of 
j n such that M-kn ~ >focai where Ai is some spatially invariant matching 
on the PWLT. 



Remarks. The proof of Theorem |13| is fairly simple, using compactness argu- 
ments. The proof of Theorem [l^ in Q is difficult and lengthy. In light of 
the new results of this paper, we only need Theorem [l^ for Ai = -M op t, and 
perhaps the explicit structure of At opt could be used to simplify the proof of 



Theorem 12 



To formalize local convergence requires further notation. Fix n and let 
V"( n ) be the set of vertices v = V\Vi . . . vi with v\ < n and v% < n — 1 for 
i > 2. Write T( n ) = (V( n ),E( n )) for the corresponding subtree of T. Let 
G nn be the complete bipartite graph, with vertex-set {1,2, ...,n} x {1,2}. 
Given a realization c = (c(i,j)) of the cost matrix, one can define a graph 
homomorphism ip = ip c from T( n ) onto G nn as follows. 

V#) = (M) 

for i 6 {1, 2, 3, ... , n}, define tp(i) = (j, 2), for the j such that c(l,j) is the 
i'th smallest of {c(l, u), 1 < u < n} 
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for i £ {1, 2, 3, . . . , n — 1} and i! € {1,2,..., n}, define = (k, 1), for 

the k such that c(/c, 1) is the i'th smallest of {c(u, 4>(i')), 1 < -u < n, u / 1} 

and then inductively: for v = v\V2 ■ ■ ■ V2 m , for i G {1, 2, 3, . . . , n — 1}, 
define ^(-ui) = (j, 2), for the j such that c(ip(v),j) is the i'th smallest of 
{c(^(i>),u), 1 < u < n, (u, 2) ^ ip(v~)} where v~ is the parent of v 

for v = v\V2 ■ ■ ■ V2m+i, for i E {1, 2, 3, . . . , n — 1}, define = (A;, 1), for 

the A; such that c(A;, Y>(f )) is the i'th smallest of {c(u, ip(v)), 1 < u < n, (u, 1) ^ 

This folding map V'c induces an unfolding map which uses the matrix c to 
define edge-weights (W^ n \e)) on the edge-set E^ ra ): 

(v, vk) = c(i,j) if Mv) = (*,!) and ^ c (wA;) = (j, 2) 

or if Vc(^) = (j, 2) and ip c (vk) = (i, 1). 

Now for /i > 1 define E^ C E as the set of edges each of whose end- vertices 
is of the form v\V2 ■ ■ ■ vi with / < h and maxj Vi < h. So E^) is a finite set of 
edges. It is straightforward to see that for fixed h 

(wW(e),e€E w ) 4 (W(e),e € E (h) ) 

where the limits are the edge- weights in the PWIT. (Essentially, this is the fact 
that the order statistics of n independent exponential (mean n) r.v.'s converge 
to the points of a Poisson (rate 1) process.) Recall that we represent a random 
n-matching as {0, 1}- valued random variables A4 n (e) indexed by the edges e of 
G nn . Use the homomorphism tp (considered as a map on edges) to define 

M n {e) = M n ^(e)), e e EH 

We can now define local convergence A4 n ^f oca i M. to mean: for each fixed h, 

((wW(e),M n (e)), e€E w ) 4 ((W(e), M(e)), e € E (h) ) . (12) 
3.2 Heuristics for the construction of M op t 

Underlying the rigorous construction in the next section is a simple heuristic 
idea. Given a realization of the PWIT, consider defining 

= cost of optimal matching on T 

— cost of optimal matching on T \ {<fi}. (13) 
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Here we mean total cost, so we get oo — oo, and so (|T^) makes no sense as a 
rigorous definition, though statistical physics uses such renormalization argu- 
ments all the time. But pretend ( |T3| ) does make sense. Then for each v G V 
we can define X v similarly in terms of the subtree T" rooted at v (the vertices 
of T" are v and its descendants): 

X v = cost of optimal matching on T v 

— cost of optimal matching on T v \ {0}. 

And we get the recursion 

X v = min (W(v,vj) - X vj ) (14) 

1<j<oo 

because the left side is the cost difference between using or not using v in a 
matching on T^; to use an edge (v, vj) we have to pay the cost of the edge and 
the difference between the cost of not using or using vj, which is the right side. 
Moreover in the optimal matching on T v , vertex v should be matched to the 
vertex vj attaining the minimum in fll4j). 

In the next section we show that one can make a rigorous argument by first 
constructing (by fiat) random variables satisfying the recursion (0), then using 
these random variables to define a matching. Not having interpretation (p^) 
means it's not obvious rigorously that this matching is optimal, but it turns 



out (see start of proof of Proposition 18) that weak optimality is quite easy to 
prove. 

3.3 The construction 

Each edge e E E of T corresponds to two directed edges e , e : write E for 
the set of directed edges. For directed edges we have the language of fam- 
ily relationships: each edge e = (v',v) has an infinite number of children 
of the form (v,y), y ^ v'). Thus the directed edge (273,27) has children 
(27,2), (27,271), (27,272), (27,274),.... The PWIT has edge- weights W(e) 
on undirected edges, and we write W(e) = W(e) = W(e) for the directed 
edges e , e corresponding to e. 

Lemma 14 Jointly with the edge-weights (W(e),e G E) of the PWIT we can 
construct {X(e), e GE} such that 

(i) each X{e) has the logistic distribution 

(ii) for each e , with children e~i, e2, ■ ■ ■ say, 

X(7)= min (W^-X^)) (15) 

1<j<oo 
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Proof. For a vertex v = ■ ■ - iu write \v\ = h. For h > 1 write 

Eft = {e"= : |u| =h,j> 1} 

E<ft = {"e=(u,y): |v| < h, \y\ < h}. 

Take independent logistic random variables {X(e) :esEh}, independent of 



the family (W(e)). Then use (15) recursively to define X(e) for each eGEa. 
Lemma [| and the natural independence structure ensures that each X{e) 
has logistic distribution. The construction specifies a joint distribution for 
{W(e),e £ E; X(e), e£Eft U E</i}. The Kolmogorov consistency theorem 
completes the proof. □ 

Remarks, (a) By modifying on a null set, we shall assume that the minimum 
in (|l5|) is attained by a unique j. 

(b) For an edge e = (v',v) directed away from <fr, the X(e) constructed above 
formalizes the notion of X v in the previous section. 

(c) The same type of construction can be associated with general fixed-point 
identities for distributions which have an appropriate format. See e.g. Aldous 
U for its use in studying a model of "frozen percolation" on the infinite binary 
tree. 

The corollary below (whose routine proof we omit) formalizes the indepen- 
dence structure implicit in the construction. For a directed edge e , consider 
the set consisting of all its descendant edges, but not e itself; then write D{ e ) 
for this set of edges, considered as undirected edges. 

Corollary 15 For each i > 1 let /i,2, • • ■ € E and let e^i, e^, • • • €E. 
Write Di = DjD(eij) U /» 2> • • •}• If the sets Di are disjoint as i varies 
then the a-fields a(X(eij),W(fij), j > 1) are independent as i varies. 

Remark. It is natural to ask whether X(e) is <r(W(e),e 6 D( e ))-measurable, 



in other words whether the influence of the "boundary" E^ in the Lemma 14 
construction vanishes in the h — > oo limit. In the terminology of statistical 
physics, this asks about extremality of free boundary Gibbs states. We have 
not studied this question carefully, but there are reasons to suspect the answer 
is negative. 

For v, v' G V write v ~ v' if (v, v') is an undirected edge. Fix a realization 
of (W(e),e <G E; 1(e), eeE). For each v G V define 

v* = arg min(W(v, v') — X(v,v')) (16) 
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that is, v* is the vertex attaining the minimum. 

Lemma 16 The set of undirected edges {(v,v*) : v £ V} is a matching on 
the PWIT. 



Proof. It suffices to check that (v*)* = v for each v 6 V. Fix v. Then 

min (W(v,y) 
X(v*,v) by ©. 



W(v,v*) — X(v,v*) < min (W(u, y) — X(v, y)) by definition of v* 



In other words 

X(v,v*) +X{v*,v) > W(v,v*). (17) 
Suppose (v*)* = z ^ v; then 

W{v*,z) - X(v*,z) < W{v*,v) - X(v*,v). 

But by (II) 



X(v,v*) < W(v*,z) - X(v*,z). 

Adding these last two inequalities gives X(y,v*) < W(v*,v) — X(v*,v) which 
contradicts (|l~7|) and establishes the lemma. □ 

Write A'lopt for the random matching specified by Lemma 16. Note that the 
argument leading to ( |l7|) can be reversed, to give a more symmetric criterion 
for whether an edge is in A4 opt - 

e is an edge of M opt iff W(e) < X{e) + X(e) (18) 

where e , e are the directed edges corresponding to e. It seems intuitively clear 
that -A/[ op t should be spatially invariant; we verify this in section |4~l] as Lemma 
24| 



Remarks, (d) It is not obvious whether A4 Q pt is a function of the weights 
(W(e),e € E) only, or involves additional external randomization. In fact by 
( |l8| ) this question is equivalent to the question in remark (c) above. 

(e) Intuitively, the quantities X{e) play a role analogous to dual variables 
in linear programming. But we are unable to make this idea more precise. 

3.4 Analysis of .M op t 

Consider the random cost W((p, Ai op t((p)) of the edge ((f), M. op t(4>)) containing 
the root <f> in -M pt- 
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Proposition 17 (a) The random variable W(4>, -M pt(<A)) has the probability 
density function h(-) described in Lemma^, and so EW((t>,J\4 op t((f>)) = tt 2 /6. 
(b) 

P(M opt (<t>) = k) = 2~\ k > 1. 

Proof. The edge-weights (W(cft,i),i > 1) and the X-values (X((j),i),i > 1) are 
distributed as the Poisson process and the i.i.d. logistics (Xi) in Lemma 
[|. Using the latter notation and the definition ([[(]) of M pt, 

W((j),M op t(^)) = £/, where I = argmin(^ - Xi). 

1>1 

As a sophisticated way to obtain this distribution (there are alternate, elemen- 
tary ways) fix < y < oo and condition on the event A y := {3J : £j = y}. 
Conditionally, the other points ^ J) and other X-values (X; L ,i ^ j) are 
distributed as a Poisson process > 1) say and i.i.d. logistics (Xj,j > 1) 

say, independent of Xj, whose conditional distribution remains logistic. So 

P(I = J\A y ) = P{y -Xj < X') where X' = mm(^ - X-). 

But by Lemma || X' has logistic distribution; since it is independent of Xj, we 
see from the definition of h(-) that P(I = J\A y ) = h(y). Then 

P(Zi e [y, y + dy\) = P(I = J\A y )P{ some & in [y, y + dy\) = h{y)dy 

establishing part (a). For part (b) we need to show 

P(I> Jfe + l) = 2~ k , k>l. (19) 

From the argument above 

P(I = J,I>k + l\A y ) = P{y -Xj< mm(£ - Xj), & < y) = h k {y), say . 

So 

P(Zl e[y,y + dy],I>k + l) = h k (y)dy 

and then 

roc 

P(I>k+l)= / h k (y)dy 
Jo 

and the integral is evaluated by Lemma P. 
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Proposition 18 Let M. be a spatially invariant matching of the PWIT such 
that P(M (<P) + M op t{(f>)) > 0. Then EW{4>,M (</>)) > EW{<t>,M ov t{4>))- 

Before embarking upon the proof of Proposition [18L let us show how Theorems 
H| - |D are deduced from Propositions 17 and 15. Indeed, Propositions 17 and 18 
imply 

ml{EW{(j), KA ((/))) : Ad is spatially invariant matching on the PWIT} = ^- 

and so Theorem |ll| implies Theorem [l]. We next claim that the optimal assign- 
ments in the n-matching problem, considered as random matchings A4 n as in 
section 3J, satisfy 

■M n — ^focai -^opt and (c(l, A4 n (1)), 1 < n < oo) is uniformly intergrable. 

(20) 

If not, then by Theorem |l3| some subsequence converges locally to some KA ^ 

M op t, so limsup n EA n > EW(4>, M (<fi)) > vr 2 /6, the last inequality by Propo- 
sition 18, but this contradicts Theorem |l|. Now ( f20|) implies Theorems ^ and 
||, because the definition of local convergence ( |T2| ) implies 

M n (1), c(l, 1), c(l, 2), . . .) 4 to op t(0), W(</>, 1), W(</>, 2), . . .) 

and then Proposition 17 identifies the required limit distributions. To prove 
Theorem ||, fix 5 > 0. Suppose the assertion were false for that 5; then we can 
find random j n -matchings A4j n (for some j n — > oo) such that 

P (jU jn (1) +M ]n (1)) > S (21) 

where as above A4j n is the optimal j n -matching; and 

Ec(l,M jn (1)) ^vr 2 /6. (22) 



But by Theorem 13 we can, after passing to a further subsequence, assume 

Ma 

^locai -M-i for some spatially invariant random matching KA. Then 
(0) implies P(M ((/)) + A4 op t(^)) > 5, while (J§) implies (0)) < 

EW((j), KA opt (</>))> contradicting Proposition |l~8|. This "proof by contradiction" 
establishes Theorem ||. 

Examining the argument above, we see that a weak inequality in Proposition 
18 would be sufficient to prove Theorem ffl, while the strict inequality is needed 



for our proofs of Theorems 
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3.5 Proof of Proposition 18. 

Let A4 be a spatially invariant matching, and write A for the event {M (</>) ^ 
•A^opt (</>)}• On A write 

(«_i,u ,«i) = (>fopt(0),0,A^ (0)). 

Then on ^4 we can define a doubly-infinite path . . . , u_2, uoi ^1)^2, ■ • ■ by: 
V — oo < m < oo 

(«2m-l,V2m) is an edge of M opt 
(v2m,V2m+i) is an edge of X. 



In section 4.1 we shall use spatial invariance to prove 

Lemma 19 Conditional on A, the distributions of X(v-2,V-i) and X(vq,vi) 
are the same. 

Now 

X(v^ 2 ,v-i) = min (W(v-i, y) - X{v- lt y)) by (15) 

= (W(«-i,wb)-X(t;_i,Ud)) (23) 
by (|i"6|), because (v_i,vo) is in -M pt- Also, by ( p~5| ) 

«d) <WK,Vl)-X(« 0)Ul ) (24) 

so that 

£> := W(wo, ui) - X(v , m) - t*,) > 0. 

Combining ([23] ) with this definition of Z) gives 

W(v 0l V!) - W(v ,v-i) = D + X(vq, v{) - X(v- 2 ,v-i). 

So 

EW(<f>,M (M-EWfaMaptty)) 
= EiWivQ^-Wiv^v-^U 
= ED1 A + EX(v ,v^)l A - EX{v- 2 ,v-x)1a 
= ED1 A by Lemma 19. (25) 
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Since D > this is enough to establish the weak inequality EW((f>,M ((f))) > 
EW(<p,M pt((j))). To establish the strict inequality, suppose that to the con- 
trary EW((j),M ((f))) = EW((f),M op i((f))). Then (||) implies EDl A = 0, and 
then (p4|) implies that on A we have X(v-i,(f>) = W((f), v\) — X((f),v\). And 
this implies that on A we have 

v\ = argmin ^(W(<p, i) — X(<fi, i)) 

i 

where min^l denotes the second-smallest value, because M. ((f)) = V—% = 
argminj(W((/>, i) — X((f>,i)). So without restricting to A we have 

p(M ((f)) = aigmm(W((f),i) -X((f),i)) or arg min [2] (W((f), i) -X((f),i))\ = 1. 

By an immediate use of spatial invariance we must have the same property at 
every v £ V: 

P (v) = argmm(W(v, i) — X(v, i)) or argmin^(W(v,i) — X(v,i))^j = 1. 

(26) 

So it's enough to show this can't happen. 

Proposition 20 The only spatially invariant random matching on the PWIT 
satisfying is -M op t- 

Remark. One can regard Proposition as a kind of "subcritical percolation" 
fact. There is some set .M2-opt of edges (v' , v") such that 

v" = axgmm^(W(v',v) —X(v',v)) and v' = arg min ^(W(v",v) - X(v",v)). 

In contrast to -M op t, the first equality here does not imply the second. One can 
show that Proposition ^ is equivalent to the assertion that there is no infinite 
path consisting of alternating edges from A4 op t and Al2-opt- 

Proof. Define a path <p = wo, i0— i, W— 2, W— 3, . . . inductively by: for m = 
1,2,... 

w-2m+i = arg min (W(w- 2m +2, v) ~ X(w- 2m +2, y)) 
w-2m = arg min [21 (W(w-2m+i, y) - X(w- 2m +i, y))- 

1/~M>-2m+l 
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Then 



(w-2m+i, W-2m,+2) is an edge of M opt , each m = 1, 2, 3, 



Suppose were a spatially invariant matching satisfying ( pq ) such that A := 
{>! (</>) / -M op t(<A)} has P(A) > 0. Then on A we have 

(w_2 m , w_2-m+i) is an edge of Ai, each m = 1, 2, 3, . . . . 

(The construction of (w m ) resembles the previous construction of (%), but 
note that the definitions of (w m ) above and of B m , B q , B* below involve only 
the PWIT and the (X{e)), not any hypothetical matching M.) So by fl26|) for 

V = W-2m 



[2] 

and so 



AC B m := {w_ 2m +i = arg min 1 1 (W(w- 2m , y) ~ X(w-2 m ,y))} 



A C B := tlm=iB m . 
Writing B q := r\ q m=1 B m we have 

if P(A) > then lkn P(B q+1 )/P(B q ) = 1. (27) 

In section [4.1| we shall use spatial invariance to prove 
Lemma 21 P(B q+1 ) = P(B q n B*), where 



„, s [2] 

w/iere 



£* := {0 = arg min L '(Wfwj, y) — X(iui, y))} 



:= argmin [2] (Ty(0,y) - X((j>,y)). 

y~<p 



Using Lemma 



to, = h M;f) = P(Br\ B') = 

<-~ P(B,) a P(B,) P(B) 



So by (p7|), to prove Proposition ^ and hence Proposition 18 it is enough to 
prove 

Lemma 22 If P(B) > then P(B*\B) < 1. 
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Remark. Intuitively, Lemma [2^ seems clear for the following reason. Event B 
depends only on what happens on the branch from <f> through W-i, while B* 
depends only on what happens on the branch through w\ . Even though there is 
dependence between the branches, the dependence shouldn't be strong enough 
to make B* conditionally certain to happen. Formalizing this idea requires 
study of the effect of conditioning the PWIT, and we defer completing the 
proof until section 4.2, though the next lemma is one ingredient of the proof. 

Lemma 23 Define 

X 1 = mm(W(<f>,i) - X 

i>l 

/ = argmin(PF(0, i) — X{(f>, i)). 

2>1 



For — oo < b < a < oo define 



X 



g(a,b) = P (W{(j),I) - b> min ^{W{I, Ik) - X(I, Ik)) 

V k>l 

Then g(a, b) > 0. 

Proof. Note that ([H]) shows 

X(<f>, I) = mm(W{I, Ik) - X(I, Ik)). 

k>l 

Conditionally on X(<f>, I) = x, the other values of {W(I, Ik) — X(I, Ik), k > 1} 
are the points of a certain inhomogeneous Poisson process on (x, oo), and so 

p(mmW(W(I,Ik) -X(I,Ik)) € [y,y + dy] 
V fe>l 



x{<t>,i) = x\ =p x (y)dy (28) 



for a certain function /3 X (') such that (3 x {y) > for all y > x. The quantities in 
( f28[) are independent of W(4>, I) and so ([2g|) remains true if we also condition 
on W((p, I) = a + x. So 



g(a,b,x) r- 



P [ W((f>, I)-b> min [2] (W(I, Ik) - X(I, Ik)) 



X(cf>,I)=x, W(cf>,I) = a + 



satisfies g(a,b,x) > for all — oo < b < a < oo and — oo < x < oo. Since 
= W(i, I) - X((f>, I) we see 

g(a,b) =E(g{a,b,X{<PJ))\X^ = a) >0 

as required. 
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4 Some technical details 



4.1 Spatial invariance proofs 

The definition (Definition [l^) of spatial invariance for a random matching A4 
on the PWIT involves the (probability) distribution fi of ((M(e), W(e)), e G E) 
and the a-finite measure fi* := fi x count on Z. Intuitively, if we use an arbi- 
trary rule to distinguish some vertex k (the rule depending on the realization 
of Ad and the W(e)), then the /^-distribution of the resulting configuration 
on Z is just the //-distribution of the configuration without the vertex being 
distinguished. 

To see the point of spatial invariance, let us show that the greedy matching 



M greedy from section p.l is not spatially invariant. 




Figure 4. Part of a realization of w € B* and of 9 x (w) E B. 

Consider the event B := {W(l, 11) < W(<j),l) < W(l,12)} which has proba- 
bility 1/4. So B* :=Bn{l is distinguished } has //*-measure 1/4. The inverse 
image B = 9^ 1 (B*) is (see figure 4) 

B = {W((f>,2) < W(2,21), 2 is distinguished} 

and this has //-measure 1/4 because P(W(0,2) < W(2,21)) = 1/4. (This 
equality of /t*-measures is a consequence of the fact that the distribution of 
edge- weights is spatially invariant.) Now for a random matching M. to be 
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spatially invariant, the event 

B* n {M (0) = 1} 

must have the same /x*-measure as its inverse image under 8, which is the event 

BC\{M (</>) = 2}. 

But for the greedy matching we always have A4 (</>) = 1, so the former event 
has fj,* -measure 1/4 while the latter event has //""-measure 0. 



Proof of Lemma 1L In words, the idea is that the distribution of X(vq, V\) 
as seen from <f> = vq is the same (by spatial invariance) as the distribution 
of X(vq,vi) as seen from uj, which (just by relabeling) is the distribution of 
X(«_2,«-i) as seen from vq = 4>. Precisely: 

P(A,X{v^ Vl ) s-) = p(m (0) + M opb (d>),X(<l>,M (</>)) e 



fj,* \M (4>) + M op t(<j>),X((f), M (</>)) e-,M {4>) is distinguished^ 
M * (m (0) ^ M opt (<j)),X(M {<j)),4>) e-,M (</>) is distinguished) 



using spatial invariance to switch the root from <f> to Ai (4>) , and noting that the 
event {M (</>) / .M op t(^)} is identical to the event {M (M (0)) / -M pt(-M 

(*))}; 



= fj,* \ M (<P) + M opt ((l>),X(M (</>), <j>) € ;M opt ((t>) is distinguished 
because /i*-measure doesn't depend on the rule for distinguishing a vertex; 
= // (^A4 (A^opt(^)) # M,pt CM op t (<£)), (A1opt(0)),^opt(^)) G -,>( O pt(0) is distinguished 

using spatial invariance to switch the root from <p to A4 pt (4>)'> 
= H*[M{4>)^ M opt (<t>),X(M (M opt (4>)),M opt {<t>)) e -,.M opt (</>) is distinguished 
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because the events are identical; 



= P[M (0) + M opt {(f>),X(M (M opt ((f>)),M opt {(f>)) G -J 

= P(A,Jf(«_ 2 ,V-l)€-) 

because v-\ = A4 op t((p) and t>_2 =A4 (v-\). 

Proof of Lemma This is essentially the same argument as above. In 
words, the idea is that the probability of B q n B* as seen from cj) = wq is 
the same (by spatial invariance) as the probability of B q n B* as seen from 
W2 = &rgmm w ^ Wl (W(wi,w) — X(wi,w)), which (just by relabeling) is the 
probability of B q+ \ as seen from wq = 4>. 

We leave details to the reader. 



Lemma 24 -M pt is spatially invariant. 

Proof. M op t is determined by the W(e) and the X{ e ), and the X{ e ) satisfy the 
deterministic relation (ff5t) which is unaffected by relabeling vertices. Moreover 



the joint distribution of the X(e) is determined by the fact (Lemma 14) that 

the r.v.'s (X(e), eGE/i) are independent logistic. In words, we need to show 

that the property "the (X(e), eSEfe) are independent logistic" is preserved 
under 9. 

Fix k and write 

B = {(I(e),eGEi,) € C, k is distinguished} 

for arbitrary C in the appropriate range space. Write 

Ai = {W(l, k — l)< W(ct), I) < W(l, k)}. 

Then (see figure 5) 

6~ 1 {B) = Ui A[ n {I is distinguished } n {(X(e ), ~e eBh,i) G C} 

where ~Eh,i is the set of edges (v, vj) with j > 1 and with v = j\j2 ■ ■ ■ jh-i 
for ji ^ /, or with v = lj\j2 ■ ■ ■ jh- Since J2i P{A{) = 1 and the X{e) under 
consideration are independent of Ai, verifying fJ.*{B) = fi*(9~ 1 (B)) reduces to 

showing that {X(e), eGE^,;} are independent logistic. But this follows from 
the construction (Lemma [l4] and Corollary [If 
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hh ■■■jh 




khh ■ ■■jh-i 




hji ■■■jh 




jlj2 ■ --jh-l 




Eft 






Eh,i 





Figure 5. 



4.2 The bi-infinite tree 

Recall A denotes the distribution of the edge- weights (W(e),e € E) of the 
PWIT. So A is a probability measure on W = (0, oo) E . To incorporate a 
distinguished neighbor of (j) we extend to the state space W x {1, 2, 3, . . .} and 
introduce the a- finite measure A x count. We now describe an equivalent way 
of representing this structure, which turns out to be convenient for certain 
calculations with M op t- 

Take two copies, T + and T say, of the PWIT, and write their vertices 
as +v and —v. Then construct a new "bi-infinite" tree by joining the 
roots +4> an d —<fi of T + and T~ via a distinguished edge Write 
E^ for its edge-set. Let edge-weights on the edges of each of T + and T~ 
be distributed as in the PWIT, independently for the two sides of T^~\ Then 
define a <7-finite measure A^ on := (0, oo) E by specifying that the weight 
W(— (ft, +4>) on the distinguished edge should have "distribution" uniform on 
(0, oo), independent of the other edge- weights. 
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-11 



+11 



Figure 6. Part of a realization w of edge-weights on the bi-innnite tree 

The point of this construction is that there is a natural bijection between 
W x {1,2,3,...} and which takes A x count to A"~\ In fact, if k is 

the distinguished vertex of T, then relabel vertices according to the rules 
relabel k as —(f) 

relabel j as +j for j < k — 1 and as +(j — 1) for j > k + 1 
relabel descendants accordingly. 

This relabeling induces a map ijj : W x {1, 2, 3, . . .} — > W M which is invertible. 
See figure 6. Checking that ip maps A x count to A*^ reduces to the following 
easy lemma. 

Lemma 25 Write A := {(xj) : < x\ < X2 < . . . , Xi —>■ oo}. Write Pois for 
the probability measure on A which is the distribution of the Poisson process 
of rate 1. Consider the map x : A x {1,2,3,...} — > A x (0, oo) which takes 
((xi),k) to ((xi,i ^ k),Xk)- Then \ maps Pois x count to Pois x Leb, where 
Leb is Lebesgue measure on (0, oo). 
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Remark. There is a natural map "reflect" from to W^, induced by 

the bijection of vertices +v «-► — v. The map 6 : W x {1,2,3,...} — > W x 
{1,2,3,... ,} featuring in the definition of spatial invariance (where we now 
ignore matchings) is related to tp as indicated in the diagram. 

Wx{l,2,...} ^ 

[0 [ reflect 



Wx{l,2, ...} %- W + 



In M, our Theorem |11| was stated in terms of matchings on T*~* instead of 
T. In this paper we switched to using T as the basic limit structure for two 
reasons: 

(i) on T we can define a random matching M. using probability distributions 
(instead of having to introduce cr-finite measures) ; 

(ii) the definition of spatial invariance (which in either setting involves cr-finite 
measures) is simpler on T than on T""*. 



In section [4.5| we reconcile the definitions. 

The relabeling used to define ip can also be used to define a family (X( e 
), e a directed edge of T*"*) in terms of the X(e) on the directed edges of the 



PWIT, constructed in Lemma 14. It is easy to check that the joint distri- 



bution of (W(e),X(e);e, e edges of T^) thus obtained is the same as if we 



applied the construction in Lemma 14 to (W(e),e G E*~*), replacing Eh in the 
construction by 

bE h = {e= (+v,+vj) : \v\ = h, j > 1} U (e= (-v, -vj) : \v\ = h,j>l}. 

(29) 

Then the matching M pt can be defined on T w in the same way as on T: 

e is an edge of M opt iff W(e) < X(e) + X(e). 
Lemma ^6] below shows how working on the bi-infinite tree is useful for calcu- 



lations. Informally, Lemma 26 describes the distribution of -M pt as seen from 
a typical edge in M. pt, and exhibits a conditional independence property for 
the restrictions of .Mopt to the two sides of the tree determined by that edge. 
On the PWIT define = mir^>\(W{(j),i) — X{<j>,i)), and write v x for the 

conditional distribution of the family (W(e),e € E; X( e), e EE, e directed away from 
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given X^ = x. Returning to the bi-infinite tree, write A 1 for the measure ob- 
tained by restricting A^ to the set {W{— cj), +<f>) < X{—<f>, +</>) + X(+<p, -(f))}. 
So under A 1 , in -M pt the vertex +<j) is a.s. matched with vertex —<p. 

Lemma 26 A 1 is a probability measure. Under A 1 we have: 

(i) the joint density of (W(— (j), +<f>), X{— (f>, +(p), X(+(f>, — <fi)) at (w,xi,X2) is 
equal to f (x±) f (x2)l(o <w<Xl+X2 ) , where f is the logistic density; 

(ii) conditional on (W(—<fi, +</)), X(—<j>, +<f>), X(+(f>, —(f)) = (w, x\,X2), the dis- 
tribution of the family 

(W(e),e E F, + ;X(e), eEE , e directed away from + <f>) 

is the image of ' u Xl under the natural embedding T — > T + C T^; the distribu- 
tion of the family 

(W(e),e E E~;X(e), e EE , e directed away from — <p) 

is the image of u X2 under the natural embedding T — > T~ C T^; and these two 
families are conditionally independent. 

Note that because +<f> is matched to — <j>, we have X(+j, +4>) = W(+(f>, —(f) — 
X (+(/>, —(f)) and then one can recursively construct X(e) for e directed toward 
(—(ft,+(J)). So the prescription in Lemma 26 is enough to specify the joint 
distribution of all the X(e) and hence of -M pt ; under A 1 . 

Proof of Lemma \2^. The joint density has the form stated in (i) by con- 
struction; and so its total mass equals J f(xi + X2) + f(x±)f(x2) dx\dx2 which 

equals 1 by Lemma | Next, by the construction based on (X(e), e^bE^) at 
(f29|), we see that under A^ the families 



{W(e),e E E + ;X(e), eEE , e directed away from +0} UX(— <fi, +0) 
and 

{iy(e),eeE";I(e),eGE ,~e directed away from - 4>} Ul(+^,-^) 

are independent of each other and of W{— <p, +4>). Since A 1 is defined by 
an event depending only on {X(+(j), —</>), X(—tf>, +4>), W(+<j>, —</>)}, we obtain 
the desired conditional independence property. Each family under A* - * is the 
distributed as the image of the corresponding family on the PWIT (making 
X(—(j), +(f>) correspond to X*-), and so by independence under A^ the condi- 
tional distribution under A 1 depends only on x\ (resp. x-i). □ 



32 



Recall the map ip : W x {1, 2, 3, . . .} — > which takes A x count to A M . 
The inverse image of the event {W(— 4>, < X {—</>, +<p) + X(+<f>, —</>)} is 

ip^iW '(-</>, +<j>) < X{-(j),+4>) +X(+(f>,-<f>)} = {M opt (0) is distinguished}. 

Thus the inverse image of the probability measure A 1 is A x count restricted to 
{M op t((j)) is distinguished}. Then after un-distinguishing this vertex, we are 
left with probability distribution A on the PWIT. 

So in summary, we have established the following "relabeling principle" . 

Given (Z(e)) and hence M pt on the PWIT, map the whole struc- 
ture to the bi-infinite tree by relabeling ((f), M.opt(4>) as (+</>> ~4> an d 
relabeling other vertices accordingly; then the resulting distribution 
on the bi-infinite tree is A 1 . 

To see why this is useful, let us give a quick second proof of Proposition |i~7|(a). 

The distribution of W(4>, M. op t(4>)) in the PWIT is the same, by the relabeling 
principle, as the distribution on the bi-infinite tree of W{— <f>, +0) under A 1 . 
Then by Lemma |26|(i) 




P(Xx + X 2 >w) 



for independent logistic X\ , Xi . 



4.3 Calculating with the bi-infinite tree 



On the bi-infinite tree define 



C* := {+<p = arg mm 
y — hi 



W(W(+I,y)-X(+I,y))} 



where 



/ = arg min(VF(+0, +i) - X(+(j>, 



a(X(e),W(e) :e,e edges of T+) 
a(X(e), W(e) : e, e edges of T~) 

<j(x(+0, -<p),x(-4>, +</>), w(+<p, -</>)). 
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Lemma 27 \ x {0*°^ , F*} = g(X (-</>, +</>),W(+<l>, -<j>) - X(+<f>, -</>)) for g 
defined in Lemma 

Proof. C* is ^""'"-measurable, so by the conditional independence assertion of 
Lemma ^6] we have 

\ 1 {C* c \F-,F t '} = X l {C* c \^}. 
Thus we have to show that 

A 1 {C* C |(^(-<A, +<f>),X(-<f>, +<t>),X(+c/>, -4>)) = (w, Xl ,x 2 )} = g( Xl ,w - x 2 ). 

(30) 

Under this conditioning, Lemma p6| implies that the family (W(e), e € E + ; X( e 
), eGE , e directed away from + 0) is distributed as the image of the family 

(W(e),e € E;X(e), e£E, e directed away from cf>) conditioned on {X^ = 
x\\. By definition of g(a, b) in Lemma 

g(x 1 ,w-x 2 ) = \ 1 {W(+<f>,+I)-(w-x 2 )> mmM(W(+I,y)-X(+I,y)) 

\{W(-<f>,+<l>),X{-<t>,+<f>),X(+<l>,-<j>)) = (w,xi,x 2 )}. 

But under this conditioning 

W{+<j),+I)-{w-x 2 ) = W(+(f>,+I) - (W(+0,-0) -0)) 

= W(+I,+0)-X(+/,+<£) 

by (|i~5|), because under A 1 the vertex +</> is always matched to —(p. Substituting 
into (|30|), we see that the event in ( |30[ ) is precisely the event C* c , as required. 

Proof of Lemma 111]. The relabeling principle shows that P(B*\B) can be 
rewritten as A 1 {C*|C}, for a certain event C which is .F^-measurable and such 
that P(B) = X l {C}. Now 

\ 1 {C* C \C} = E x i by Lemma [27 

> if A 1 {C} > by Lemma || 

establishing Lemma |22|. 



4.4 Remarks on quantifying Theorem 4 

Recall equation (|25[): the difference in cost between 7V4 pt and another spatially 
invariant random matching A4 equals EDI a- To quantify Theorem^], we would 
like to show 
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if P{M (</>) + MaptW) = S then ED1 A > e(8) 

with some explicit lower bound on e(5): our current "proof by contradiction" of 
Theorem [| shows only that e(6) > 0. Now it is not difficult to improve Lemma 
23 to a stronger result giving a lower bound on a corresponding quantity, under 
the same conditioning. Given some spatially invariant Ai, one can condition 
on the one-sided infinite path +</>, —<p, — J, ... in whose edges alternate 
between Ai op t and Ai, and seek to apply the improved lemma to the conditional 

distribution of the edge (+</>, Ai (+</>)). But the difficulty with this scheme is 
that the definition of Ai might depend on the whole tree; we cannot argue that a 
priori the behavior of Ai on T + and on T~ has some conditional independence 



property, as exploited in the proof of Lemma 27 



4.5 Reconciliation 

Here we reconcile the way definitions and results were stated in with the 
way they are stated in this paper. 

Write W* = W x {1,2,3,...}. The map 9 in the definition of spatial 
invariance is a map : W* — > W* which preserves A x count. Given a spatially 
invariant random matching Ai, define 7 : W* — > [0, 1] by 

7 (w, i) = P{M ((f)) = i\W{e) = w(e) Ve G E). (31) 

Then 7 must have the following two properties; the first because is a match- 
ing, the second because of spatial invariance. 

(i) Ei7(w,i) = 1. 

(ii) 7(0(w,i)) = 7(w,i). 

And we can write the associated cost as 

(iii) EW&M (0)) =/ w Ei u>(M7(w,») A(dw). 

It turns out that we can reverse the argument: given a function 7(-) satisfy- 
ing (i) and (ii) , one can define a spatially invariant random matching satisfying 
(|3T[), whose cost therefore satisfies (iii). Now use the bijection ip to define 

flf(w^) = 7(V>~ 1 (vO). 

Then g(-) satisfies certain consistency conditions corresponding to (i,ii), and 

the cost (iii) becomes 

(iii*) / w „<7(w~)A"(dw~) u 

Thus the quantity inf {EW((f>, Ai (</>)) : Ai spatially invariant} in our Theo- 
rem |ll] equals the infimum of (iii*) over functions g satisfying the consistency 
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conditions, and that was the formulation of the limit constant lim n EA, 
Theorem 1. 

Proposition 3(a), stated there for the optimal 



in H 



Our Theorem 13 



is 



n- 



matchings but extending unchanged to arbitrary spatially invariant matchings. 
Our Theorem 12 is obtained by combining Proposition 3(b) and Proposition 2 
of §; local convergence is the method of proof of those results. 



5 Variants of the random assignment problem 

From our asymptotic viewpoint, the non-bipartite matching problem - with 
n x n matrix (c(i,j)) with c(i,j) = c(j,i) and n even - is the same (up to 
normalization convention) as the bipartite problem, because the limit random 
structure and problem is exactly the same matching problem on the PWIT. 
Below we describe variants where the limit random structure or problem is 
different. 



5.1 Power-law densities 

Consider the random assignment problem 

n 

A n = min^c(i,7r(i)) 
n i=i 

where now the (c(i,j)) are i.i.d. with density f c { ) satisfying 

f c (x) ~ x r as x | (32) 

for some < r < oo. This setting is motivated by trying to "fit" mean- field 
models to the Euclidean matching problem on random points in [0, l] d , for 
which the distribution of inter-point distances satisfies (32) with r = d — 1: see 
[{12|| for further discussion. In this setting, EA n will grow as order n~, and so 
we rescale the problem to study A' n = n~~A n ; in other words 

n 

A 'n =mfn±£y(i,7r(i)) 
i=l 

where c'(i,j) = n 1// ( r+1 ^c(i, j). Mezard - Parisi ( ]l9p eq. (22,23)) use the replica 
method to argue EA' n — > j r where the limit constant is characterized by 

/oo 
G{l)e~ G{l) dl (33) 
-oo 

/oo 
^ (I + y) r e~ G{y) dy, -oo < I < oo. (34) 
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Let us indicate how our approach gives essentially the same result. Underlying 
the connection between the matrix with i.i.d. exponential (mean n) entries and 
the PWIT is the fact that the order statistics (£1 , £2 ) • • •) of n exponential 
(mean n) r.v.'s satisfy 

(&\&\...) ± (6,6,-..) (35) 

where the limit is a Poisson (rate 1) process. In the current setting, the order 
statistics (£,• ) of n r.v.'s distributed as c'(i,j) satisfy (|35|) where the limit (6) 
is now the inhomogeneous Poisson process of rate 

P( some point of (6) in [x,x + dx]) = x r dx. (36) 

Without checking details, it seems clear that following the method of [|| and 
this paper leads to the formulas which parallel (0,0) 



/•CO 

= / x • x r P(Xi + X 2 > x) dx (37) 
Jo 

X = min(6-X 4 ) (38) 

i 

where (6) is now the inhomogeneous Poisson process (|36|). The calculations 
below will check that (|33|,34) and (|37|,38) are essentially equivalent. Writing 

F(x) = P(X > x), (H) says 

F(x) = P( no point of (6, Xj) lies in {(z, b) : z — b < x}) 

= exp ( - f°° z r F(z - x) dz] . (39) 



Setting y = z — x gives 

F(x) = exp (- J (y + x) r F(y) dy 
Setting G{x) = — \ogF(x) gives 

poo 

G{x) = (y + x) r e~ G{y) dy 



which agrees with ( |34|) up to the constant factor 2/r!. [why the discrepancy? 
The 2 is the normalization convention mentioned in section 0. I guess the r! is 
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a typo at |T^] eq. (5), where the V should be V /r! as it is in [ 20 1 eq. (2.1).] 
Next, says 



roo poo 

/ x r+1 dx / P{X X G dy)P(X 2 > x - y) 

JO J-oo 

/oo roo 
P(X G dy) / x r+1 F(x - y) dx 
-oo Jo 

/oo poo 
F(y)dy / x r+1 F'(x - y)dx 
-oo Jo 

(integrating by parts over y) 

/oo poo 
F{y)dy (r + 1) / x r F(x — y)dx 
-oo Jo 

(integrating by parts over x) 



/oo 
F (y)(- log F(y)) dy by© 
-oo 

/oo 
G(y)e- G( ^ dy for G(y) = - log F{y) 
-oo 



and this is exactly (|33|). 

5.2 The combinatorial TSP 

In the combinatorial (i.e. symmetric mean-field) traveling salesman problem 
(TSP) we take a n x n matrix of "distances" c(i,j) such that (c(i,j), 1 < i < 
j < n) are i.i.d. exponential (mean n) and c(i,i) = and c(j,i) = c(i,j). We 
study 

L n := min i ^ c(i, ir(i)) 

i 

where tt is a cyclic permutation of {1, 2, . . . , n}. One can mimic the heuristic 



argument of section 3.2. That is, for the subtree rooted at v, a tour is 
a set of doubly- infinite paths which pass once through each vertex of T"; an 
almost-tour is the variation in which the root v is the start of one path in the 
set. Write (heuristically) 

(*) X v = cost of optimal tour on T v 

— cost of optimal almost-tour on T". 

One can continue the parallel heuristic to argue that the solution format of the 
combinatorial TSP is very similar to that for the random assignment problem 
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given by (|,|): 

POO 

lim EL n = / zP{Xi + X 2 >z) dz (40) 
n Jo 

where the distribution of X, X^ is determined by 

X = min' 2 ](£j — X{) where (&) is a Poisson(l) process; (41) 

i 

where min™ denotes the second-smallest value (so the only difference between 
the two solutions is the replacement of min by min^ ) . By copying the proof of 
Lemma [5] one can progress toward an explicit solution. Writing F{y) = P(X > 
y), ([Il|) says (in the notation of Lemma |5|) 

F(y) = P{ or 1 points of V in {(z, x) : z — x < y}) 

exp I — / F(u)du I . 

Now write G(y) = F(u)du so that the fixed-point equation (fll|) becomes 
the fixed-point equation 

G'(y) = (1 + G(-y)) exp(-G(-y)). (42) 

To rewrite (40) in terms of G, set z = x\ + X2- 

roc roc roc 

/ zP(Xi +X 2 > z) dz = / z dz F'(x 2 )F(z - x 2 ) dx 2 

JO Jo J-oo 



oo 
oo 



z dz I F'(z — xi)F(xi) dxi 

J — oo 

/oo /*oo 

F{x\) dx\ \ zF'(z — x\) dz 

-oo Jo 



-oo 
f oo 



/oo poo 
F(x\) dx\ I F{z — x\) dz 
-oo Jo 

F{xi)F(x 2 ) dx\dx 2 

xi+X2>0 

-G'(xi)G(-xi) dxi. 



Now use (fl^) to see that (40) becomes 

/oo 
G{x){l + G{x))e- G{x) dx. (43) 
-oo 
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So we can rewrite our solution (40,41) as (42,43), which is the solution given 



by Krauth - Mezard [15] using the cavity method. Earlier work [2C] using the 
replica method gave more complicated formulas. As in section 5.1 



one can 



consider the more general setting (32) and, after some calculus, our solution 
(with £j now defined via (|36D) again coincides with the general-r expression in 
[jTsfl . Our probabilistic expressions (4C.41;|j3|) perhaps makes the mathematical 
similarities between the heuristic solutions of TSP and the random assignment 
problems more visible than do the analytic expressions in the physics literature. 
In contrast to the random assignment case, the solution of identity (E^) seems 
to have no simple explicit formula. Numerically solving ([4^j43|) gives a limit 
constant of about 2.04 in agreement with Monte Carlo simulations of the 
finite-n combinatorial TSP, and the agreement persists for larger values of r; 
see Percus - Martin |2(| for a recent review. 

Returning to the bipartite setting, one can define a ^-assignment problem: 
study 

:= min \ V a 



— V 

nk / i '''J 



where S denotes an edge-set such that \{j : £ S}\ = k Vi and \{i 



€ S}\ = k Vj. A very similar heuristic argument indicates that lim n EA 



(k) 



should be given by the analog of (40,41) with fc'th minimum in place of second- 
minimum. 



5.3 Gibbs measures on assignments 

Fix a parameter < A < oo. Take (c(i,j),l < i,j < n) i.i.d. exponential 
(mean n). Define a non-uniform random permutation II n of {1, 2, . . . , n} by 



P(n n = tt\ all the c(i,j)) oc exp(— A c(i, vr(i))). 



(44) 



The statistical physics approach to the random assignment problem (and other 
combinatorial optimization problems over random data) is to first study such 
Gibbs measures and then take A — > oo limits. 

It seems plausible that our method extends to the study of the n — > oo, A 
fixed, limit behavior of this Gibbs measure, though it may be technically chal- 
lenging to make these ideas rigorous. In brief, regard the right side of (44) as 
specifying a random matching IIoo on the PWIT, and seek the density h\{x) of 
cost-x edges in the matching IIoo. To mimic the heuristic argument of section 
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3.2, then regard the right side of as specifying a "matching or almost- 



matching" Il + on T + , and define 

-P(I1 + is an almost-matching 



X, 



-P(I1+ is a matching 



One can argue heuristically that this Xa, should satisfy the distributional iden- 
tity 



/ oo 

d 



X ± [Y^e-^XA (45) 



\i=l 



for Poisson (rate 1) £i, and that the desired limit density is 

yi) = '- E (ui^)' <46) 

Similar ideas are implicit in Talagrand IpQl , who obtains some rigorous results 
for small A. Mezard - Parisi (|II| eq. (18-20)) derive a non-rigorous solution 



in a different form; presumably (cf. sections |5.1| and 5^) the two forms are 
equivalent but we have been unable to verify this. It is easy to verify that, 
writing X = e~ xu in (|45|) , expressions (45,46) are consistent in the A — > oo 



limit with (^,|3]). Formalizing our approach in the Gibbs setting seems more 
challenging than in previous settings, for the following reason. A central part 
of H is showing (Proposition 2) that given an almost-complete matching, one 
can construct a complete matching for small extra cost. Proving an analogous 
result in the Gibbs setting will presumably require the same type of technical 
estimates needed in 



6 Final remarks 

6.1 The AEU property in random optimization 

One can define the AEU property in quite general "optimization over ran- 
dom data" settings. Consider for instance the Euclidean TSP involving ra i.i.d. 
points on [0, l] 2 . Writing L(T n ) for the length of the shortest tour, we have 
H EL(T n ) ~ era 1 / 2 for constant c. The AEU property would be: for non- 
optimal tours T' n , if En~ x #{e : e G T n \ T^} > 5 then liminf n ra" 1 / 2 (SL(T^) - 
EL(T n )) > e(5) where e(5) > 0. Whether or not the AEU property holds for 
the Euclidean TSP is a challenging question. Conceptually, the AEU prop- 
erty seems an interesting way of classifying optimization problems from the 
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average-case viewpoint and seems worthy of further study. Note it is an in- 
trinsic property of the problem, not of any particular algorithm to solve the 
problem. 



6.2 The exact conjecture 

Our proofs are purely asymptotic, so do not shed light on the exact conjecture 
(||). Note (cf. proof of Proposition [l?]) that h(x) represents the asymptotic 
chance than a cost-x edge gets into the optimal matching, so in particular a 
cost-0 edge has asymptotic chance h(0) = 1/2 to be in the optimal matching, 
as suggested by previous work (p3|, [p^| sec. 4.1). 

It is very natural to conjecture that var A n ~ a 2 /n for some < a < 
oo, and that rescaled A n has a Normal limit: this is supported by Monte 
Carlo simulation [|l^]. One can define a candidate value a in terms of the 
optimal matching on the infinite tree, which (roughly speaking) reflects local 
dependence, but it is not apparent even heuristically whether a should coincide 
with a. 

Parisi (2l| observes that EA\ ~ EA n +l for small n. The implicit conjecture 
is an amusing instance of how numerics can mislead; what's really going on is 

EA 2 n k, (EA n ) 2 for large n 

and 7r 2 /6 just happens to be close to the solution of z 1 = z + 1. 



6.3 Frieze's £(3) result 

It is intriguing that the limit ("(3) arises in the somewhat analogous prob- 
lem of the minimum-weight spanning tree on the complete graph with i.i.d. 



edge- weights (Frieze [10Q. That problem can also be studied using the PWIT 
(Aldous |l[]). The density function h{x) with mean £(3) analogous to h(x) is 

h{x) = \{l-q\x)) 

where q(x) = on < x < 1 and for x > 1 is the non-zero solution of 

1 — q(x) = exp(— xq(x)). 

Thus we see the same structure: calculations of asymptotic quantities involve 
a fixed-point identity. It is natural to speculate that some other problem has 
solution C(4). 

Acknowledgements. I thank Greg Sorkin, Boris Pittel and Michel Talagrand 
for discussions and encouragement. 
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